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RELATIONSHIPS  AMONG  AN  INDIVIDUAL  INTELLIGENCE  TEST 
AND  TWO  AIR  FORCE  SCREENING  AND  SELECTION  TESTS 


?.  INTRODUCTION 

The  aptitude  assessment  program  of  a  military 
organization  largely  defines  the  effectiveness  of 
the  organization  in  satisfying  its  manpower  needs. 
The  initial  scree.iing  and  subsequent  classification 
of  applicants  must  occur  in  such  a  manner  as  to 
define  accurately  the  abilities  of  the  manpower 
resource,  while  maintaining  fair  and  unbiased 
selection  practices.  In  terms  of  the  individual, 
selection  and  classification  testing  determines  the 
areas  of  his  career  opportunity  which  later  affect 
his  motivation,  job  satisfaction,  and  career 
retention.  In  terms  of  the  military  organization, 
the  objective  of  the  selection  program  is  the 
economical  utilization  of  its  human  resources. 

To  optimize  the  usefulness  of  any  selection 
device,  an  effort  must  be  made  to  determine  its 
validity  in  the  particular  selection  process.  Jn 
addition  to  aptitude  test  validation,  much  current 
emphasis  has  also  focused  upon  developing  or 
maintaining  unbiased  or  “culture  free”  assessment. 
In  the  past,  enlistees,  both  draft  and  non-draft 
motivated,  produced  a  manpower  pool  with  a 
sufficiently  wide  range  in  aptitude  to  provide 
adequately  for  personnel  needs.  Now,  in  the 
absence  of  draft  pressure,  the  Air  Force  may  be 
restricted  in  its  manpower  input  and  must  insure 
that  its  selection  tests  tap  the  range  of  ability 
accurately,  particularly  in  minority  groups  where 
possible  test  bias  can  confound  test  results. 

To  meet  the  needs  of  the  enlisted  manpower 
pool,  the  two  main  aptitude  measures  utilized  by 
the  Air  Force  in  selection  and  assignment  are  the 
Airman  Qualifying  Examination  (AQE)  and  the 
Armed  Forces  Qualification  Test  (AFQT).  The 
AQE  was  adopted  for  use  by  the  Air  Force  in 
1958  (On  1  July  1973  the  Armed  Services  Voca¬ 
tional  Aptitude  Battery  (ASVAB)  replaced  the 
AQE  in  the  Air  Force  testing  inventory.  Like  the 
AQE,  xhe  ASVAB  will  provide  the  four  Apt-'tude 
Indexes.)  The  development  and  standardization  of 
the  last  AQE,  Form  J,  has  been  described  by 
Vitola,  Massey,  and  Wilbourn  (1971).  Admin¬ 
istered  and  scored  at  the  recruiting  level,  the  AQE 
yields  four  Aptitude  Indexes  (AIs)- Administrative 
(AQE-A),  Electronic  (AQE-E),  General  (AQE-G), 
and  Mechanical  (AQE-M).  These  AIs  have  been 
used  in  the  selective  recruiting  and  initial  assign¬ 
ment  of  basic  airmen.  Although  the  AQE  was 


originally  developed  as  a  military  classification 
tool,  in  1962  the  High  School  Testing  Program  was 
established  and  the  AQE  was  administered  to  large 
numbers  of  students  as  part  of  the  schools’ 
counseling  program. 

A  second  selection  instrument  administered  to 
all  pre-inductees  and  enlistees  is  the  AFQT.  (In 
late  1973  the  ASVAB  replaced  the  AFQT.  An 
AFQT  composite  was  drawn  from  ASVAB  sub¬ 
tests.)  This  Department  of  Defense  test 
simultaneously  evaluates  verbal,  numerical,  spatial, 
and  mechanical  abilities.  Its  single  composite  score 
is  primarily  used  to  determine  mental  categories 
and  service  eligibility  of  candidates  for  military 
service.  Administration  and  scoring  usually  occur 
at  the  Armed  Forces  Entrance  and  Examining 
Stations  (AFEES). 

A  majority  of  previous  research  has  foci>sed  on 
the  development  and  periodre  validation  of  the 
AQE  or.  samples  of  technical  school  graduates, 
although  several  studies  were  initiated  to  compare 
the  AQE  with  commercial  aptitude  batteries,  or  to 
investigate  the  extent  of  aptitude  test  differences 
among  various  subgroups  of  the  population. 
Considering  their  impact  upon  the  career 
aspirations  of  thousands  of  potential  Air  Force 
Recruits  and  upon  high  school  counseling,  such 
investigations  were  justified.  Madden,  Valentine, 
and  Tupes  (1966)  compared  the  AQE  with  the 
Differential  Aptitude  Tests  and  fourd  *hat  both 
batteries  measure  essentially  the  same  factors  or 
abilities.  Madden  and  Tupes  (1966)  described  the 
relationships  among  the  AQE,  the  California 
Achievement  Test,  and  the  Davis  Reading  Test  in 
the  estimation  of  reading  achievement.  Madden 
and  Valentine  (1967)  found  a  moderate  positive 
relationship  between  ability  measures  obtained 
from  the  AQE  and  the  Employee  Aptitude  Survey. 
From  these  studies,  the  AQE  compares  favorably 
with  other  aptitude  batteries. 

Comparative  studies  of  the  AFQT  by  the  Air 
Force  have  mainly  involved  other  Air  Force  ability 
tests,  such  as  the  Airman  Classification  Test 
(Thompson,  1958),  the  Airman  Proficiency  Tests 
(Brokaw,  1959),  and  the  AQE  (Valentine,  1968). 
It  should  be  noted  that  Anastasi  (1968)  lists  the 
AFQT  under  “Group  Intelligence  Tests  and 
Developmental  Scales”  (p.  640). 
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The  possibility  of  bias  against  minority  groups 
in  Air  Force  selection  instruments  has  recently 
been  investigated  by  Guinn,  Tupes,  and  Alley 
(1970a).  They  have  reported  a  study  exploring  the 
relationships  of  demographic  variables  to  several 
Air  Force  aptitude  tests.  They  found  that  a 
majority  of  test  scores  was  significantly  related  to 
race,  education  level,  and  geographic  area  of  the 
trainee.  Racial  differences  were  most  prominent  in 
tests  requiring  specific  prior  knowledge,  such  as 
verbal  and  mechanical  tests,  while  situationally 
defined  tests  measuring  non-verbal,  spatial,  and 
psychomotor  abilities  revealed  less  pronounced 
racial  effects.  A  second  study  by  Guinn,  Tupes, 
and  Alley  (1970b)  investigated  racial  bias  in  the 
AQE  as  a  predictor  of  final  school  grade  in  10  Air 
Force  technical  school  courses.  Bias  against 
minority  groups,  defined  as  underprediction  of 
final  school  grade,  was  not  found  in  any  of  the  10 
courses.  There  was,  however,  a  tendency  for  the 
AQE  to  overpredict  black  performance  in  six  of 
the  courses  (three  of  these  race  differences  were 
found  to  be  statistically  significant).  Possible  racial 
bias  in  the  AQE  was  also  investigated  by  Shore  and 
Marion  (1972)  who  employed  the  AQE  to  predict 
performance  on  the  Specialty  Knowledge  Test 
(SKT),  a  test  of  job  knowledge  used  in  deter¬ 
mining  an  airman’s  promotability  to  a  higher 
grade.  Bias,  defined  as  underprediction  of  the  SKT 
by  the  AQE,  was  not  found  in  the  1 6  career  areas 
investigated.  No  Air  Force  research  has  directly 
investigated  the  question  of  bias  against  minority 
group  members  in  the  AFQT. 

The  present  study  extends  the  exploration  of 
the  relationships  of  AQE  and  AFQT  scores  to 
other  ability  measures.  A  test  of  general  mental 
ability,  the  Wechsler  Adult  Intelligent  Scale 
(WAIS),  was  employed  to  accomplish  this 
objective. 


ii.  method 

Subjects 

The  subjects  were  200  non-prior  service  male 
Air  Force  basic  trainees.  One  hundred  black  and 
one  hundred  white  airmen,  at  Lackland  AFB, 
Texas,  were  randomly  selected  over  a  12  month 
period  (February  1972-February  1973)  during 
routine  testing  on  their  6th  day  of  basic  training. 
The  age  of  the  subjects  ranged  from  1 7  to  25  years 
with  a  mean  age  of  19  years  (black  19.13,  white 
18.95  years).  Education  ranged  from  8  to  16  years 
of  schooling  completed.  Mean  education  level  was 
12  years  completed  (black  11.96,  white  12.15). 


Procedure 

The  complete  Wechsler  Adult  Intelligence  Scale 
was  individually  administered  to  the  trainees  by 
the  authors.  Because  the  authors  were  members  of 
the  military  and  could  possibly  elicit  anxiety  in  the 
trainees,  military  aspects  of  the  situation  were 
deemphasized.  Each  subject  was  asked  to  divorce 
himself  as  much  as  possible  from  the  military 
situation  (i.c.,  relax,  smoke,  drink  a  beverage). 
Each  subject  was  further  told  to  do  as  well  as  he 
could,  that  he  was  in  an  experimental  situation, 
and  that  his  te^  scores  would  not  be  entered  into 
his  records  or  used  for  job  placement  in  the  Air 
Force.  Subjects  we  re  not  told  that  they  were 
taking  an  intelligence  test.  When  questions  about 
the  nature  of  the  test  <:rose,  subjects  were  told  that 
they  were  taking  a  test  of  general  mental*  abSity. 

Scores  or  the  AQE  and  AFQT  were  obtained 
from  basic  training  records  after  the  WAIS  was 
scored.  The  WAIS  provided  three  scores,  Verbal  IQ 
(VIQ),  Performance  IQ  (PIQ),  and  Full  Scale  IQ 
(FS1Q),  against  which  the  four  AQE  AIs  and 
AFQT  scores  were  contrasted. 


III.  RESULTS 

To  insure  that  the  sample  was  representative  of 
the  airman  basic  population,  a  series  of  z  tests 
was  performed  between  the  sample  and  the  total 
1972  Air  Force  accession  data.  Black  and  white 
AQE  and  AFQT  data  were  analyzed  separately. 
There  were  no  statistical  differences  between  any 
AQE  or  AFQT  means.  The  z  tests  are  summarized 
in  Tables  1  and  2. 

Means  and  standard  deviations  of  the  four  AQE 
AIs,  the  AFQT  score,  the  three  WAIS  IQs,  and 
WAIS  subteste  for  the  black  and  white  samples 
appear  in  Table  3.  The  t  tests  between  the  black 
and  white  airman  means  revealed  significant 
differences  on  all  measures  except  for  the  Digit 
Span  and  Digit  Symbol  subtests  of  the  WAIS.  In 
each  case,  black  airmen  scored  lower  than  white 
airmen.  Also  presented  in  Table  3  are  the  non¬ 
significant  tests  for  age  and  education  differences. 

A  correlation  matrix  of  AFQT,  AQE  AIs,  and 
WAIS  IQs  appears  in  Table  4  for  the  black  sample 
and  in  Table  5  for  the  white  sample.  Correlations 
were  not  corrected  for  range  restriction  effects.  All 
correlations  for  the  black  sample  were  positive  and 
significant  (p  <  .05),  except  for  PIQ  vs.  AQE-A, 
VIQ  vs.  AQE-M,  and  FSIQ  vs.  AQE-M,  where 
correlations  were  positive  but  nonsignificant.  All 
correlations  for  the  white  sample  weie  positive  and 
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Table  1.  Comparison  of  AFQT  and  AQE  Aptitude  Indexes  for  100 
Black  Enlistees  and  the  Total  1972  Black  Enlistees  Accessions 


Measure 

6!*ck 

Er.UsU* 

Sample 

1972  Black  Acc 

essions 

ZZ 

N 

M 

SC 

N 

M 

SD 

AFQT 

100 

41.68 

13.83 

10,456 

43.32 

15.57 

-1.05 

AQE-A 

100 

44.55 

16.82 

10,472 

45.90 

18.76 

-  .72 

AQE-E 

100 

45.25 

18.57 

10,472 

47.10 

18.00 

-1.03 

AQE-G 

100 

48.26 

16.73 

10,471 

50.65 

15.56 

-1.54 

AQE-M 

100 

44.20 

18.56 

10,467 

44.06 

18.29 

.08 

aFor  a  discussion  of  the  use  of  the  z  score  to  indicate  'he  likelihood  of  a 
sample  mean  differing  from  the  population  mean,  see  Hays,  Statistics ,  1963,  p. 
203*204.  In  this  situation,  M  -  m 

z  = 

o 

sj  N 

where  M  =  sample  mean,  m  =  population  mean,  o  -  population  standard 
deviation,  and  N  =  number  of  cases  in  the  sample.  Applying  Tchebycheft's 
inequality  to  the  above  z' s  reveals  the  piobabflity  of  these  sample  means  deviating 
as  much  as  they  do  from  the  population  means  approaches  1.00  for  all  tests  for 
both  racial  groups,  given  the  s'zc  of  the  groups,  and  the  population  standard 
deviations. 


Table  2.  Comparison  of  AFQT  and  AQE  Aptitude  Indexes  for  100 
White  Enlistees  and  the  Total  1972  White  Enlistees  Accessions 


Measure 

White  Enlistee  sample 

1972  White  Accessions 

a* 

N 

M 

SD 

N 

M 

SD 

AFQT 

100 

64.98 

20.67 

70,614 

64.42 

19.90 

.28 

AQE-A 

100 

56.01 

21.47 

70,919 

58.05 

20.73 

-  .98 

AQE-E 

100 

63.65 

20.06 

70,920 

65.19 

19.77 

-  .78 

AQE-G 

100 

60.90 

19.41 

70,916 

63.33 

18.31 

-1.33 

AQE-M 

100 

62.75 

18.95 

70,901 

62.28 

19.73 

.23 

aFor  a  discussion  of  the  use  of  the  z  score  to  indicate  the  likelihood  of  a 
sample  mean  differing  from  the  population  mean,  see  Hays,  Statistics,  1963,  p. 
203*204.  In  this  situation,  =M*m 

z  =  - 

o 

s/  N 

where  M  =  sample  mean.  ;n  =  population  mean,  a  =  population  standard 
deviation,  and  N  =  number  of  cases  in  the  sample.  Applying  Tchebychcffs 
inequality  to  the  above  z’s  reveals  the  probability  of  these  sample  means  deviating 
as  much  as  they  do  from  the  population  means  approaches  1.00  for  all  tests  for 
both  racial  groups,  given  the  size  of  the  groups,  and  the  population  standard 
deviations. 
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Table  3.  WA1S  Subtests  and  IQs,  AFQT  Scores,  AQE  Aptitude  Indexes, 
and  Age  and  Education  Levels  for  100  Black  and  100  White 
Air  Force  Enlistees 


Measures 

Slack  Sample 
(N=100) 

M  SO 

White  Sample 
(N=100) 

M  SD 

t 

P 

Information 

8.87 

2.08 

10.77 

2.34 

6.03 

.01 

Comprehension 

O 

/  t 

2.67 

10.83 

2.74 

4.21 

.01 

Arithmetic 

8.68 

2.09 

10.70 

2.48 

6.20 

.01 

Similarities 

9.61 

2.39 

10.46 

2.59 

239 

.05 

Digit  Span 

9.  '9 

2.76 

9.67 

2.46 

1.29 

ns 

Vocabulary 

8.74 

1.84 

10.24 

2.15 

5.26 

.01 

Digit  Symbol 

9.76 

2.16 

10.33 

2.17 

1.85 

ns 

Picture  Completion 

9.69 

1.75 

11.46 

2.40 

5.96 

.01 

Block  Design 

9.21 

2.27 

12.22 

2.66 

8.58 

.01 

Picture  Arrangement 

9.97 

2.11 

11.12 

2.45 

333 

.01 

Object  Assembly 

9.01 

2.31 

11.73 

3.08 

7.03 

.01 

Verbal  IQ 

96.74 

8.78 

104.99 

10.47 

6.02 

.01 

Performance  IQ 

97.41 

9.65 

109.30 

10.57 

8.26 

.01 

Full  Scale  IQ 

96.72 

8.33 

107.17 

9.90 

8.04 

.01 

AFQT 

41.68 

13.83 

64.98 

20.67 

9.32 

.0! 

AQE-A 

44.55 

16.82 

56.01 

2  i  .47 

4.18 

.01 

AQE-E 

45.25 

18.57 

63.65 

20.06 

6.70 

01 

AQE-G 

48.26 

16.73 

60.90 

19.41 

4.91 

.01 

AQE-M 

44.20 

18.56 

62.75 

18.95 

6.96 

.01 

Age 

19.13 

1.45 

18.95 

1.42 

.88 

ns 

Education 

11.96 

.92 

12.15 

1.00 

1.42 

ns 

Table  4.  Intercorrelations  of  WAIS  IQs,  AFQT  Scores,  and 
AQE  Aptitude  Indexes  for  100  Black  Air  Force  Enlistees 


Test 

1 

2 

3 

4 

5 

6  7  8 

1 .  WAIS  Veibal  IQ 

2.  WAIS  Performance  IQ 

.46 

3.  WAIS  Full  Scale  IQ 

.88 

.83 

4.  AFQT 

.38 

.44 

.48 

5.  AQE-Admin 

.27 

.19 

.27 

.26 

6.  AQE-Elcc 

.26 

.26 

.30 

.37 

.31 

7.  AQE-Gcn 

.23 

.29 

.30 

Jo 

.62 

.55 

8.  AQE-Mech 

.10 

.23 

.19 

.30 

.29 

.40  .63 

p  <  .05  =  .192  p  <  .01  = 

.254 
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Table  5.  Interconrelatiom  of  WAIS  IQs,  AFQT  Scores,  and 
AQE  Aptitude  Indexes  for  100  White  Air  Force  Enlistees 


Tmtt 

1 

2 

3 

4 

s 

C  7  B 

1.  WAIS  Verbal  IQ 

2.  WAIS  Performance  IQ 

.54 

3.  WAIS  Fufl  Scale  IQ 

.91 

.84 

4.  AFQT 

.64 

.62 

.71 

5.  AQE-Admin 

.60 

.28 

.52 

.51 

6.  AQE-Elcc 

.69 

.60 

.74 

.75 

.56 

7.  AQ E-Gen 

.77 

.44 

.7! 

.64 

.71 

.73 

8.  AQF-Mech 

.45 

.50 

.53 

.67 

.34 

.69  .56 

p<.05  =  .l«J5  p<01  = 

.254 

signifio'Tit  (p  <  .05).  The  intercorrelations  of 
WAIS  IQs  with  AQE  AIs  and  AFQT  scores.,  and 
AFQT  scores  with  AQE  AIs,  were  consistently 
lower  for  black  airmen  than  for  white  airmen.  A 
correlation  matrix  for  each  sample  which  includes 
demographic  data  and  WAIS  subtests  appears  in 
Tables  6  and  7. 

Regression  analyses  were  run  on  the  200  case 
sample  utilizing  the  WAIS  IQs  as  criterion  scores 
and  the  AFQT  and  AQE  AIs  as  individual 
predictors  to  determine  .acial  differences  in  test 
scores.  The  regression  design  and  models  appear  in 
Table  8.  Table  9  presents  a  summary  of  the 
regression  analyses.  Where  VIQ  served  as  the 
criterion,  significant  race  interactions  were 
indicated  for  all  AQE  AIs  as  predictors.  There 
were  no  significant  race  effects  where  AFQT 
served  to  predict  VIQ.  Where  PIQ  served  as  the 
criterion  score,  significant  level  or  intercept 
differences  were  indicated  for  the  AFQT,  AQE-A, 
and  AQE-G.  Significant  interactions  (or  slope 
differences)  were  found  for  AQE-E  and  AQE-M. 
Where  FSIQ  served  as  the  criterion  score,  signifi¬ 
cant  intercept  differences  were  found  for  AFQT 
and  AQE-A.  Significant  interactions  were  found 
when  AQE-E,  AQE-G,  and  AQE-M  served  as 
predictors  of  FSIQ. 


IV.  DISCUSSION 

Consistent  with  the  results  of  previous  studies 
where  Air  Force  selection  tests  have  been 
compared  with  other  ability  measures,  significant 
positive  relationships  were  found  between  AQE 
and  AFQT  performance  and  performance  on  a  test 
of  general  mental  ability.  Also  consistent  with 
more  recent  minority  group  test  performance 
analyses  is  the  finding  of  racial  differences  within 


Air  Force  tests.  In  the  present  study,  black  airmen 
consistently  scored  significantly  lower  than  v/hite 
airmen  on  all  Air  Force  selectors. 

Analyses  of  WAIS  performance  revealed  that 
black  and  white  enlistees  had  statistically  different 
Verbal,  Performance,  and  Full  Scale  IQs.  An 
examination  of  WAIS  subtests  shows  that  black 
airmen  scored  significantly  lower  on  all  subtests 
except  Digit  Span  and  Digit  Symbol.  Although 
mean  IQs  for  both  samples  fell  within  the  normal 
range  (90-110),  black  airmen  scored  8  to  12  IQ 
points  lower  on  the  average.  Score  variance  was 
less  than  the  theoretical  15  point  IQ  standard 
deviation  for  both  groups.  The  small  IQ  standard 
deviations  obtained  nusy  be  due  to  the  restricted 
population  of  Air  Force  basic  trainees.  Recruiting 
policy  (e.g.,  high  school  diploma  requirement)  cuts 
off  the  lower  tail  of  the  ability  distribution  of 
American  youth,  while  the  limited  appeal  of 
military  service  as  an  enlisted  man  cuts  off  the 
higher  tail,  the  college  graduates. 

The  relationships  between  the  AFQT,  AQE  AIs, 
and  WAIS  IQs  were  explored  both  by  simple  corre¬ 
lational  analyses  and  by  regression  analyses.  For 
white  enlistees,  the  correlations  show  that  more 
test  variance  is  shared  by  Air  Force  tests  and  the 
intelligence  test  than  for  black  airmen.  Regression 
analyses  employing  WAIS  IQs  as  criteria  and  selec¬ 
tion  test  scores  as  predictors  demonstrated  some 
type  of  racial  effect  in  all  but  one  case.  In  that 
case,  black  and  white  regression  lines  shared  a 
common  intercept  and  slope  when  the  AFQT  was 
used  to  predict  WAIS  Verbal  IQ.  Regression  lines 
differing  both  in  intercept  and  slope  occurred 
most  frequently  Implications  of  these  complex 
interactions  of  race  and  selector  score  are  not 
definitive,  and  would  require  an  investigation  of 
the  AQE  subtests  to  determine  what  specific  items 
or  group  of  items  are  causing  the  racial  differences. 
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Table  6.  Intercorreiations  of  WAIS  Subtests  and  IQs,  AFQT  Scores,  AQE  Aptitude  Indexes, 
and  Age  and  Education  Levels  for  100  Blade  Enlistees 
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Table  8.  Regression  Design 


Regrtosion  Models 

Model  1 .  WAIS  IQ  =  B  +  V/  +  (B  x  Se’  Test)  +  (W  x  Sel  Test) 
Model  2:  WAIS  IQ  =  B  +  W  +  Sel  Test 
Model  3:  WAIS  IQ  =  Se!  Test 


F  -  Tests 


(RpJ  -  Rr2  )/ df  j 
F  =  - 

(l-R!2)/df2 


Full 

Restricted 

Model 

Model 

Testing  for: 

Analysis  I 

Model  1 

vs 

Model  3 

Racial  Difference 

Analysis  II 

Model  1 

vs 

Model  2 

Slope  Difference 

Analysis  III 

Model  2 

vs 

Model  3 

Intercept  Difference 

Definitions 


WAISIQ  =  VIQ,  PIQ,  or  FSIQ  score 

B  =  1  if  black,  0  if  not  black 

W  =  1  if  white,  0  if  not  white 

Sel  Test  =  AFQT  or  AQE  AI  percentile  score 

Rp2  -  Squared  multiple  correlation  of  full  mode* 

Rr2  =  Squared  multiple  correlation  of  restricted  model 
Rj2  =  Squared  multiple  correlation  of  model  1 

df  j  =  Independent  vectors  in  full  model  minus  independent  vectors  in 
restricted  model 

df2  =  Elements  in  vectors  minus  independent  vectors  in  model  1 
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Table  9.  Summary  of  Regression  Analyses 


F  test*  for  Presence  of  RicUl  Difference  (slope  or  Intercept  differences)3 
Model  1  vs  Model  3 


Criterion 

Predictor 

R  2 

rr  _ 

F 

prob 

VIQ 

AFQT 

.40742 

.40058 

2 

196 

1-13 

NS 

VIQ 

AQE-A 

.35580 

.27294 

2 

196 

12.61 

.001 

VIQ 

AQE-E 

.'.1354 

.34079 

2 

196 

12.16 

.001 

VIQ 

AQE-G 

.46722 

.36258 

2 

196 

19.25 

.001 

VIQ 

AQE-M 

.25999 

.1737 6 

2 

196 

11.55 

.001 

PIQ 

AFQT 

.4761 J 

.44930 

2 

196 

5.01* 

.01 

PIQ 

AQE-A 

.29991 

.11748 

2 

196 

25.54* 

.001 

PIQ 

AQE-E 

.42403 

.31719 

2 

196 

18.18 

.001 

PIQ 

AQE-G 

.36390 

.22263 

2 

196 

21.76* 

.001 

PIQ 

AQE-M 

.37527 

.26173 

2 

196 

17.81 

.001 

FSIQ 

AFQT 

.54021 

.52540 

2 

196 

3.16* 

.05 

FSIQ 

AQE-A 

.38758 

.24062 

2 

196 

23.52* 

.001 

FSIQ 

AQE-E 

.51365 

.41136 

2 

196 

13.74 

.001 

FSIQ 

AQE-G 

.49822 

.36716 

2 

196 

25.60 

.001 

FSIQ 

AQE-M 

.38220 

.26353 

2 

196 

18.82 

.001 

aStarred  (*)  F  tests  denote  race  main  effects  (intercept  differences  but  no 
slope  difference).  All  other  significant  differences  were  shown  to  be  interaction 
effects  (both  slope  and  intercept  differences). 


Having  found  lest  performance  differences 
between  black  and  white  enlistees  when  age  and 
level  of  education  were  taken  into  account,  the 
question  may  be  raised  as  to  what  factors  con¬ 
tributed  to  differential  test  performance.  One 
possible  explanation  of  racial  differences  has  been 
described  by  Guinn,  Tupes,  and  Alley  (197Ca). 
They  have  categorized  memory,  spatial,  and 
psychomotor  skills  as  situational  abilities,  as 
opposed  to  prior  knowledge  abilities  which  are 
largely  a  function  of  an  individual's  formal  educa¬ 
tional  background.  Racial  differences  in  test 
performance  are  least  pronounced  when 
situational  abOities  are  required,  as  in  the  Digit 
Span  and  Digit  Symbol  subtests  of  the  WAIS. 
Racial  differences  in  test  performance  are  most 
pronounced  when  prior  knowledge  abilities  are 
required;  thus,  it  may  be  hypothesized  that  there 
is  an  oveibundance  of  prior  knowledge  items  th¬ 
in  both  Air  Force  selection  tests  and  the  WAIS.  An 
overbundance  of  prior  knowledge  items  may  be 
masking  the  identification  of  true  abilities  and 
may  account,  in  part,  for  the  differences  in  test 
performance  of  black  and  white  airmen. 


Future  revisions  of  selection  tests,  especially 
the  ASVAB  now  coming  into  use,  should  include 
investigations  into  the  degree  of  importance 
assigned  to  situational  and  prior  knowledge  items 
in  the  construction  of  tests.  Such  research  may 
discover  a  control  for  the  differential  quality  of 
education  of  black  and  white  recruits  who  possess 
the  same  quantitative  level  of  education  (e.g., 
years  of  education  or  grade  point  average). 

Additional  racial  test  variance  may  be  con¬ 
tributed  by  other  test  specific  factors.  The  results 
of  the  present  study  show  a  much  closer 
relationship  between  the  selection  tests  and  the 
intelligence  test  for  white  airmen  than  for  black 
airmen.  The  consistently  high  intercorrelations  of 
tsc  wnite  sample  indicate  a  good  deal  of  common 
variance  shared  ^ -'tween  “intelligence”  and  the 
indicators  of  success  in  the  .Air  Force  training 
situation.  The  much  lower  intercorrelations  of  the 
bleck  sample  suggest  that  the  selection  tests  and 
the  WAIS  are  measuring  more  divergent  factors.  A 
possible  area  of  inquiry  into  the  sou»,e  of  this 
disparity  in  test  commonality  is  the  literacy  factor. 
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The  AFQT  and  AQE  are  paper  and  pencil  tests 
requiring  the  subject  to  read  the  instructions  and 
test  items.  In  administering  the  WAIS,  however, 
both  instructions  and  test  items  are  orally 
presented  by  the  psychometrist.  Only  the  WAIS 
Vocabulary  subtest  involves  the  subject  reading 
the  items,  and  even  then  the  administrator  is 
pronouncing  the  words,  so  the  subject  is  not  solely 
dependent  upon  his  ability  to  read.  Thus,  it 
appears  that  one  approach  to  isolating  a  possible 
source  of  racial  variance  in  test  scores  would  be 
through  a  study  of  the  differential  literacy  require¬ 
ments  of  the  two  kinds  of  tests. 

A  second  source  of  racial  variance  may  be 
attributed  to  test  administrators.  The  WAIS  is 
administered  individually  by  professionally  trained 
psychometrists.  Air  Force  selection  tests,  however, 
are  group  administered  and  scored  by  recruiting 
level  personnel  who  have  had  minimal  formal 
training  in  testing.  Even  in  the  small  sample  of  the 
present  study,  so  called  recruiter  effects  in  the 
distributions  of  scores  arc  readily  apparent.  The 
clustering  of  enlistee  scores  at  or  just  above  AFQT 
and  AQE  cutoffs  for  career  field  entry  was  found 
in  this  study  and  others  conducted  hy  this  division 


(Vitola  &  Wilbourn,  1971).  For  example,  37%  of 
the  black  enlistees  in  this  study  fell  on  or  just 
above  the  minimum  AFQT  score  (31)  and  21%  fell 
at  the  AQE-G  40,  the  qualifying  cutoff.  Eighteen 
percent  of  the  white  sample  fell  on  an  AQE-M 
cutoff  of  60. 

As  a  final  note,  the  results  of  this  study  have 
purposefully  been  discussed  in  terms  of  racial 
differences  in  scores  rather  than  in  terms  of  racial 
bias.  Any  interpretation  of  the  cultural  fairness  of 
the  AQE  or  AFQT  based  upon  these  data  must  be 
tempered  by  the  fact  that  it  is  the  use  to  which  the 
test  is  put,  rather  than  the  test  itself,  which  defines 
the  fairness  of  the  test  (Thorndike,  197 1).  In  other 
words,  the  fairness  of  a  selection  test  is  determined 
by  its  application  in  the  selection  process,  not  its 
internal  characteristics.  The  present  study  only 
presents  data  in  relation  to  the  WAIS,  which  was 
utilized  as  a  baseline  for  measuring  general  mental 
ability.  Hopefully,  some  clarification  of  the  racial 
differences  in  the  AFQT  and  AQE  will  accrue 
from  this  study.  But  we  must  leave  a  definitive 
interpretation  of  racial  bias  in  selection  testing  to 
other  investigations. 
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